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Abstract. In comparative genomics, a transposition is an operation that 
psj exchanges two consecutive sequences of genes in a genome. The trans- 

J> position distance, that is, the minimum number of transpositions needed 

O to transform a genome into another, is, according to numerous studies, a 

^ relevant evolutionary distance. The problem of computing this distance 

when genomes are represented by permutations, called the Sorting by 
Transpositions problem, has been introduced by Bafna and Pevzner |3] 
X/^ in 1995. It has naturally been the focus of a number of studies, but the 

Q computational complexity of this problem has remained undetermined for 

c/^ 15 years. 

, In this paper, we answer this long-standing open question by proving that 

the Sorting by Transpositions problem is NP-hard. As a corollary of 
our result, we also prove that the following problem [8] is NP-hard: given 

— . a permutation tt, is it possible to sort tt using (i;,(7r)/3 permutations, where 

rfb(7r) is the number of breakpoints of tt? 

\ Introduction 

Along with reversals, transpositions are one of the most elementary large-scale operations that 
can affect a genome. A transposition consists in swapping two consecutive sequences of genes or, 

^ equivalently, in moving a sequence of genes from one place to another in the genome. The transpo- 

^ sition distance between two genomes is the minimum number of such operations that are needed 

H to transform one genome into the other. Computing this distance is a challenge in comparative 



genomics, since it gives a maximum parsimony evolution scenario between the two genomes. 

The Sorting by Transpositions problem is the problem of computing the transposition 
distance between genomes represented by permutations. Since its introduction by Bafna and 
Pevzner [3l H], the complexity class of this problem has never been established. Hence a num- 
ber of studies [H El [151 113 CZl 13 E] aim at designing approximation algorithms or heuristics, the 
best known fixed-ratio algorithm being a 1.375-approximation [12]. Other works |16 t l8t[T3 l ll9 | ll2 t [5] 
aim at computing bounds on the transposition distance of a permutation. Studies have also been 
devoted to variants of this problem, by considering, for example, prefix transpositions [UlEOlIT] (in 
which one of the blocks is a prefix of the sequence) , or distance between strings [9l [ini [Ml [22l [18] 
(where multiple occurences of each element are allowed in the sequences), possibly with weighted 
or prefix transpositions [HI [6l [H [21 [7] . 
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vr = 



(tToTTi . . . TTi-l TTj . . . TTj^i TTj . . . Trfc-l TTfc . . . 7r„) 
(tToTTi . . . TTj-lTTj . . . TTk-l TTj . . . TTj-lTTk ■ ■ . 7r„) 



Figure 1: Representation of a transposition Tij^k, with 0<i<j<k<n. 



In this paper, we address the long-standing issue of determining the complexity class of the 
Sorting by Transpositions problem, by giving a polynomial time reduction from SAT, thus 
proving the NP-hardness of this problem. Our reduction is based on the study of transpositions 
removing three breakpoints. A corollary of our result is the NP-hardness of the following problem, 
introduced by [8j: given a permutation vr, is it possible to sort tt using db{iT)/3 permutations, where 
dft(7r) is the number of breakpoints of vr? 

1 Preliminaries 

1.1 Transpositions and Breakpoints 

In this paper, n denotes a positive integer. Let [a ; = {x G N | a < x < 6}, and Idn be the 
identity permutation over [[0 ; nj . We consider only permutations of [0 ; nj such that and n 
are fixed-points. Given a word ui U2 ... ui, a subword is a subsequence Up^ Up^ . . . Up^,, where 
1 < Pi < P2 < • • • < W < A factor is a subsequence of contiguous elements, i.e. a subword with 
Pk+i =Pk + '^ for every G [I ; /' - Ij. 

A transposition is an operation that exchanges two consecutive factors of a sequence. As we 
only work with permutations, it is defined as a permutation Tj j^, which, once composed to a 
permutation vr, realise this operation (see Figure [T]). The transposition Tij^k is formally defined as 
follows. 

Definition 1 (Transposition). Given three integers i,j,k such that 0<i<j<k<n, the 
transposition Uj^k over [0 ; nj is the following permutation (we write q{j) = k + i — j): 



For any < x < i, 
For any i < x < q{j), 
For any q{j) < x < k, 
For any k < x < n, 




X 




X + j — i 
X + j — k 



X 



Note that the inverse function of Tj^- ^ is also a transposition. More precisely, t- ■ ^ 
The following two properties directly follow from the definition of a transposition: 




Property 1. Let r = Tj j be a transposition, q{j) = k + i — j , and n, f G [[0 ; nj be two integers 
such that u < V. Then: 



t{u) > t{v) i < u < q{j) < V < k 
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Property 2. Let r he the transposition t = Tij^i^, and write q(j) = k + i — j . For all x G [[1 ; nj, 
the values of t{x — 1) and t~^{x — 1) are the following: 









-1) = 


t{x) - 


1 








Vrc ^ {i, 




-1) = 


T-i(x) 




1 






-1) = 






1) = 


r~ 


-Hi) 


- 1 




-1) = 


r(A;) - 1 




1) = 


t' 


'\k) 


- 1 


T{k 


-1) = 


r(i) - 1 




1) = 




-1(0 


- 1 



Definition 2 (Breakpoints). Let ir be a permutation o/ [0; nj. // x G [[1 ; nj is an integer such 
that 7r(x — 1) = 7r(x) — 1, then (x — l,x) is an adjacency of tt, otherwise it is a breakpoint. We 
write di,{7r) the number of breakpoints of tt. 

The following property yields that the number of breakpoints of a permutation can be reduced 
by at most 3 when a transposition is applied: 

Property 3. Let tt be a permutation and t = Tij^k be a transposition (with < i < j < k < n). 
Then, for all x £ ll; nj — k}, 

{x — l,x) is an adjacency o/vr 44> {t~^{x) — 1,t~^{x)) is an adjacency of tt or. 

Overall, we have d^i^TT o r) > dbiir) — 3. 

Proof. For all x € [I ; nj — {i,j, k}, we have: 

(x — 1, x) adjacency of vr <J4> 7r(x — 1) = tt{x) — 1 

^ 7r(r(r"Hx - 1))) = 7r(T(r"i(x))) - 1 

<^ TT o t{t^^{x) - 1) = vr o T(r"i(x)) - 1 by Prop. [2] 

(t^1(x) — 1, T^i(x)) adjacency of vr o r. 

□ 

1.2 Transposition distance 

The transposition distance of a permutation is the minimum number of transpositions needed to 
transform it into the identity. A formal definition is the following: 

Definition 3 (Transposition distance). Let it be a permutation o/ [[0 ; nj. The transposition dis- 
tance dt{iT) from TT to Ldn is the minimum value k for which there exist k transpositions ti,T2, ■ ■ ■ ,Tk, 
satisfying: 

vr o Tfc o . . . o r2 o n = Ldn 
The decision problem of computing the transposition distance is the following: 

Sorting by Transpositions Problem [3] 
Input: A permutation tt, an integer k. 
Question: Is dtiir) < k? 

The following property directly follows from Property [3| since for any n the number of break- 
points of Ldn is 0. 

Property 4. Let tt be a permutation, then dt{TT) > db{TT)/3. 

Figure [2] gives an example of the computation of the transposition distance. 
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TT = 2 4 31 5 

TT o ri,3,5 = 3 1 2 4 5 

o ri_3^5 o Ti^2,4 = 1 2 3 4 5 

Figure 2: The transposition distance from vr = (0 2 4 3 1 5) to Id^ is 2: it is at most 2 since 
'''"o ''"1,3,5° ''"1,2,4 = -^^^5) and it cannot be less than 2 since Property applies with (ife(7r)/3 = 5/3 > 1. 



2 3-Deletion and Transposition Operations 

In this section, we introduce 3DT-instances, which are the cornerstone of our reduction from SAT 
to the Sorting by Transpositions problem, since they are used as an intermediate between 
instances of the two problems. We first define 3DT-instances and the possible operations that can 
be applied to them, then we focus on the equivalence between these instances and permutations. 

2.1 3DT-instances 

Definition 4 (3DT-instance). A 3DT-instance / = (S,T, ^/^) of span n is composed of the following 
elements: 

• S.- an alphabet; 

• T = {{ai,bi,Ci) I 1 < i < \T\}: a set of (ordered) triples of elements ofE, partitioning E (i.e. 

ITI 

all elements are pairwise distinct, and \^iJi{ai,bi, Ci} = T,); 

• -0 : S — 7- [[1 ; nj , an injection. 

The domain of I is the image of ip, that is the set L = | cr € S}. 

The word representation of I is the n-letter word ui U2 ■ ■ - Un over S U {■} (where ■ ^ SJ, such 
that for all i £ L, ip{ui) = i, and for i G [I ; nj — = ■. 

Two examples of 3DT-instances are given in Example [T] Note that such instances can be 
defined by their word representation and by their set of triples T. The empty 3DT-instance, in 
which E = 0, can be written with a sequence of n dots, or with the empty word e. 

Example 1. 

In this example, we define two 3DT-instances of span 6, / = {T,,T,ip) and /' = {Y^' ,T' 

I = ai C2 bi b2 ci 02 with T = {(ai,6i,ci), (02,62,02)} 
/' = ■ 62 ■ C2 ■ 02 with T' = {(a2, &2, C2)} 

Here, / has an alphabet of size 6, E = {ai, 61, ci, 02, 62; C2}, hence ^p is a bijection (V'(ai) = 1, 
■0(c2) = 2, V(6i) = 3, etc). The second instance, I', has an alphabet of size 3, E' = {a2,b2,C2}, 
with %b'{b2) = 2, iP'{c2) = 4, V'(a2) = 6. 

Property 5. Let I = {T,,T,ip) be a 3DT-instance of span n with domain L. Then 

|E| = ILI = 3|r| < n. 
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Proof. We have |S| = \L\ since ^ is an injection with image L. The triples of T partition S so 
|S| = 3|T|, and finally L C [1 ; nj so \L\ < n. □ 

Definition 5. Let I = (S,T, ^/^) be a 3DT-instance. The injection tp gives a total order over S, 
written -<j (or -<, if there is no ambiguity), defined by 

VfJi, fT2 G 0-1 ^/ (72 <^ ip{cri) < ip{cr2) (1) 

Two elements ai and a2 of S are called consecutive if there exists no element x G S such that 
ai -</ X 02- In this case, we write ai </ 02 (or simply ui <\ a2)- 

An equivalent definition is that ai -< (J2 if cJi 02 is a subword of the word representation of 
/. Also, a\ < (72 if the word representation of / contains a factor of the kind a\ ■* 02 (where ■* 
represents any sequence of / > dots). 

Using the triples in T, we define a successor function over the domain L: 

Definition 6. Let I = (S,T, -0) be a 3DT-instance with domain L. The function succj : L ^ L is 
defined by: 

V(a,6,c)GT, il^ia) ^ i;{b) 
ip{b) I-)- Vic) 
'ip{c) I— 7- ip{a) 

Function succj is a bijection, with no fixed-points, and such that succj o succj o succj is the 
identity over L. In Example [l| we have: 

A 2 3 4 5 6\ , /2 4 6\ 

^^^^^= VS 6 5 2 1 4J ^^^^ ^"^^^'=^4 6 2)- 

2.2 3DT-steps 

Definition 7. Let I = (S,T, ^) be a 3DT-instance, and {a,b,c) be a triple ofT. Write i = 
min{^(a), -(/^(c)}, j = succi{i), and k = succi{j). The triple (a, 6, c) £ T is well-ordered if we 
have i < j < k. In such a case, we write T[a,b,c,^] the transposition Tij^k- 

An equivalent definition is that (a, 6, c) is well-ordered iff one of 06c, bca, cab is a subword 
of the word representation of /. In Example [T| (ai,6i,ci) is well-ordered in I: indeed, we have 
i = tp{ai), j = V'(^i) and k = V'(ci), so i < j < k. The triple (a2,62iC2) is also well-ordered in /' 
(i = 'ip'{b2) < j = ip'{c2) < k = ip'(a2)), but not in /: i = V'(c2) < k = 'i/'(&2) < j = ^^{^2). In this 
example, we have T[ai, 61, ci, V'] = ^^3^5 and r [02 , 62 , C2 , -0'] = ^2,4,6- 

Definition 8 (3DT-step). Let I = {Tj^T^ifj) be a 3DT-instance with {a,b,c) £ T a well-ordered 
triple. The 3DT-step of parameter (a, 6, c) is the operation written ^' , transforming I into the 
3DT-instance I' = {12', T',^') such that: 

• S' = S — {a, b, c} 

. r = T-{{a,b,c)} 
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{a,b,c) 



X 
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Y 


c 


Z 



w 




Y 



X 



Z 



Figure 3: The 3DT-step 



(a, b, c) 



has two effects, here represented on the word representation of a 



3DT-instance: the triple (a, b, c) is deleted (and replaced by dots in this word representation), and 
the factors X and Y are swapped. 



A 3DT-step has two effects on a 3DT-instance, as represented in Figure |3j The first is to 
remove a necessarily well-ordered triple from T (hence from S). The second is, by applying a 
transposition to "0) to shift the position of some of the remaining elements. Note that a triple 
that is not well-ordered in / can become well-ordered in or vice- versa. In Example [T| I' can be 
obtained from I via a 3DT-step: ji^^ii^^iifl)^/'. Moreover, I' (°^'^^''^^)^ g_ A more complex example 
is given in Figure [4j 

Note that a 3DT-step transforms the function succj into succj' = o succj o r, restricted 
to L' , the domain of the new instance /'. Indeed, for all (a, b, c) £ T' , we have 

succii{'4)' [a)) = tp'{b) 

= r-\m) 

= {succi{ip{a))) 
= {succi{t{iI)' (a)))) 
= {t~^ o succi o T){^l^'{a)) 

The computation is similar for ^l^'{b) and ip'{c). 

Definition 9 (3DT-collapsibility). A 3DT-instance I = {Ti^T^ijj) is 3DT-collapsible if there exists 
a sequence of 3DT-instances Ik, Ik-i, ■ ■ ■ , lo such that 

h = I 

h = e 

In Example [l| I and /' are 3DT-collapsible, since / (ai.&i.ci)^ j/ (a2,b2,c2)^ g^ Another example is 
the 3DT-instance defined in Figure |4j Note that in the example of Figure |4j there are in fact two 
distinct paths leading to the empty instance. 
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ai e a2 h d 62 ci / C2 



(ai,5i,ci) (a2,&2,C2) 



d 62 ■ e 02 ■ / C2 



oi e ■ ci / ■ 61 d 



(rf,e,/) 

1 

a2 ■■ 62 ■■ C2 



(d,e,/) 

1 

fli ■■ bi ■■ ci 



(02,62,02) (ai,6i,ci) 



Figure 4: Possible 3DT-steps from the instance / defined by the word ai e 02 61 (i 62 ci / C2 
and the set of triples T = {(ai, 61, ci), (02, 62, C2), (d, e, /)}. We can see that there is a path from / 
to e, hence / is 3DT-collapsible. Note that both (ai,6i,ci) and (a2,62,C2) are well-ordered in the 
initial instance, each one loses this property after applying the 3DT-step associated to the other, 
and becomes well-ordered again after applying the 3DT-step associated to (d, e, /). 



2.3 Equivalence with the transposition distance 

Definition 10. Let I = (S,T, ^/j) be a 3DT-instance of span n with domain L, and tt be a permu- 
tation of [[0 ; nj . We say that I and vr are equivalent, and we write 7 ~ vr, if: 

7r(0) = 0, 

yvell;nj-L, TT{v) = Tr{v-l) + l, 

Vu G L, 7r(w) = Tr{succJ^{v) — 1) -|- 1. 

With such an equivalence I ~ vr, the two following properties hold: 

• The breakpoints of vr correspond to the elements of L (see Property [6]). 

• The triples of breakpoints that may be resolved immediately by a single transposition corre- 
spond to the well-ordered triples of T (see Figure [s] and Lemma [s]). 

Property 6. Let I = (E, T, ip) be a 3DT-instance of span n with domain L, and tt be a permutation 
of [[0 ; nj, such that / ~ tt. Then the number of breakpoints of tt is ^^(vr) = \L\ = 3\T\. 



Proof. Let v G [[1 ; nj . By Definition 10 , we have: 

If V ^ L, then Tr(v) = tt{v — 1) + 1, so {v — l,v) is an adjacency of vr. 

If f G L, we write u = succj (v), so tt{v) = 7r(u — 1) -|- 1. Since succj has no fixed-point, we 
have u ^ V, which implies n{u — 1) / tt{v — 1). Hence, 7r{v) / 7r{v — 1) -|- 1, and {v — l,v) is a 
breakpoint of vr. 

Consequently the number of breakpoints of vr is exactly \L\, and \L\ = 3\T\ by Property [5] □ 
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succi : 

u : 
7r(u) : 




i—1 i 



■K{k)-1 Tr{j) 



k^l k 
n(i) — l n{k) 



Figure 5: Illustration of the equivalence I ~ vr on three integers k) such that j = succi{i) and 
k = succi{j). It can be checked that Ti{v) = ^{u — 1) + 1 for any {u,v) G {(«, j), {j,k), {k,i)}. 

I : ai a2 as 62 C3 61 63 ci C2 T = {(o^, 6^, | 1 < z < 3} 
(ai,&i,ci) /^7r:06 4 8 7 2 1539 



t\ I': ■ 63 ■ a2 03 62 C3 . C2 T' = {(ai,6j,Ci) I 2 < z < 3} 
7r':01 56487239 



Figure 6: Illustration of Lemma [?} since / ~ vr and ji^l^hi^r, then /' ~ vr' = vr o T, where 
r = r[ai,6i,ci,V']- 



With the following lemma, we show that the equivalence between a 3DT-instance and a per- 
mutation is preserved after a 3DT-step, see Figure [6} 

Lemma 7. Let I = {T,,T,iIj) be a 3DT-instance of span n, and n be a permutation 0/ [[0 ; nj, such 
that / ~ vr. // there exists a SDT-step I , then I' and vr' = vr o r, where r = r[a, b, c, ip], are 

equivalent. 

Proof. We write k) the indices such that r = Tjj^fc (i.e. i = min{'i/;(a), ^(6), -0(6)}, j = succi{i), 
k = succi{j)). Since (a, 6, c) is well-ordered, we have i < j < k. 

We have /' = (S',r',^'), with S' = S -{a,6,c}, T' = T - {(a, 5, c)}, and %b' : a ^ T~^{^{a)). 
We write respectively L and L' the domains of / and /'. For all w' E [[1 ; nj, we have 

v'eL' ^ 3a e^- {a,b,c}, v' = T'^{%b{a)) 
^ t{v') G L - {i,j,k} 



We prove the 3 required properties (see Definition |10[) sequentially: 

• 7r'(0) = vr(r(0)) = vr(0) = 0, 

• Vu' G [[1 ; nj — L', let v = t{v'). Since v' ^ L', we have either v G k}, 01 v ^ L. In the 
first case, we write u = succj^{v) (then u G {i, j, k}). By Property 2| r~^(ii — 1) is equal to 
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r ^{succi{u)) — 1, so r ^{u — 1) = r ^{v) — 1. Hence, 

tt'{v' - 1) + 1 = Tr{T{T-\v) - 1)) + 1 
= 7r(M - 1) + 1 



= 7r{v) by Def. 10, since v G L and v = succi{u) 
= tt'{v') 

In the second case, v ^ L, we have 

7r'{v' - 1) + 1 = 7r(T(r~i(t;) - 1)) + 1 

= ■7r{T{T~^{v — 1))) + 1 by Prop. [2| since v ^ {i, j, k} 
= 7r(7; - 1) + 1 



= Tr{v) by Def. 10, since v ^ L 
= tt' {v') 

In both cases, we indeed have 7r'(u' — 1) + 1 = tt'{v'). 

• Let v' be an element of L' . We write v = t{v'), u = succj^{v), and u' = t~^{u). Then 
v' = t~^{succ[{t{u'))) = succj'{u'). Moreover, v ^ /c}, hence u ^ {i,j,k}. 

7r'{u' - 1) + 1 = 7r(r(r~^(u) - 1)) + 1 

= tt{t{t^^{u — 1))) + 1 by Prop. [2j since u ^ {i, j, k} 
= 7r{u - 1) + 1 

= 7r(t;) by Def. [Toj since u G L and u = succj^{v) 
= nir{r-\v))) 
= 7r'{v') 

□ 

Lemma 8. Let I = (S,T, ^/j) be a 3DT-instance of span n, and tt a permutation of [0; nj, such 
that I ^ TT. If there exists a transposition r = Tij^k such that db{TT or) = db{TT) — 3, then T contains 
a well-ordered triple (a, b, c) such that r = r[a, 6, c, ip]. 

Proof. We write i' = r~^(i), j' = T~^{j), and k' = T~^{k). Note that i < j < k. 

Let tt' = TT OT. For all x G [[1 ; nj — {i,j,k}, we have, by Property [sj that {x — l,x) is an 
adjacency of tt iff (r^^(x) — l,r^^(x)) is an adjacency of tt' . Hence, since db{TT') = ^^(vr) — 3, we 
necessarily have that (i — 1, i), {j — and {k — 1, k) are breakpoints of tt, and (i' — 1, i'), (/ — l,j') 
and (k' — l,k') are adjacencies of vr'. We have 

7r(z) = 7r(r(i')) 
= 7r'(i') 

= 7r'(i' — 1) + 1 since (i' — 1, i') is an adjacency of tt' 

= ^'(T-l(f)-l) + l 

= TT'{T-^{k - 1)) + 1 by Prop. [2] 
= 7r(A; - 1) + 1 
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Since / ~ vr and i ^ k, hy Definition 10, we necessarily have i £ L (where L is the domain of /), 
and i = succi{k). 

Using the same method with (j' — and {k' — l,k'), we obtain j,k £ L, j = succi{i) 

and k = succi{j). Hence, T contains one of the following three triples: {ip~^{i),ip~^{j),ip~^{k)), 
{ij''^{j),7p-^{k),i;-^{i)) or {ip^^ (k) , ip~^ (i) , (j)) . Writing (a, 6, c) this triple, we indeed have 
Ti,j,k = ''"[O; ^5 c, ^p] sincc i < j < k. □ 

Theorem 9. Let I = (S, T, ip) be a 3DT-instance of span n with domain L, and ir be a permutation 
of [[0 ; nj, such that / ~ vr. Then I is 3 DT- collapsible if and only if dt{'K) = \T\ = (ifc(vr)/3. 

Proof. We prove the theorem by induction on /c = |T|. For /c = 0, necessarily I = e and L = T = 0, 



and by Definition 10, vr = Idn (■7r(0) = 0, and for all v > 0, tt{v) = vr(f — 1) + 1). In this case, I is 
trivially 3DT-collapsible, and (ij(vr) = = |T| = (ife(vr)/3. 

Suppose now k = k' + 1, with k' > 0, and the theorem is true for k' . By Property [6| we have 
db{T^) = S/c, and by Property [4| df(vr) > 3k/3 = k. 

Assume first that / is 3DT-collapsible. Then there exist both a triple (a, 6, c) G T and a 
3DT-instance I' = (E' ,T' such that and that /' is 3DT-collapsible. Since T' = 

T—{{a,b,c)}, the size ofT' is k — 1 = k' . By Lemma[7| we have I' ~ vr' = vror, with r = r[a, b, c, 
Using the induction hypothesis, we know that dt(vr') = k' . So the transposition distance from 
vr = vr' o to the identity is at most, hence exactly, k' + 1 = k. 

Assume now that dt{Tr) = k. We can decompose vr into vr = vr' o r~^, where r is a transposition 
and vr' a permutation such that dj(vr') = k — 1 = k' . Since vr has 3A; breakpoints (Property |6]) , and 
vr' = vr o r has at most 3A; — 3 breakpoints (Property |4| , r necessarily removes 3 breakpoints, and 
we can use Lemma [sj there exists a 3DT-step I , where (o, 6, c) G T is a well-ordered triple 

and r = rfa, 5, c, We can now use Lemma [Tj which yields /' ~ vr' = vr o r. Using the induction 
hypothesis, we obtain that /' is 3DT-collapsible, hence I is also 3DT-collapsible. This concludes 
the proof of the theorem. □ 

The previous theorem gives a way to reduce the problem of deciding if a 3DT-instance is 
collapsible to the Sorting by Transpositions problem. However, it must be used carefully, since 
there exist 3DT-instances to which no permutation is equivalent (for example, / = ai 02 61 62 ci C2 
admits no permutation vr of [[0 ; 6]] such that I ~ vr). 
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3 3DT-collapsibility is NP-Hard to Decide 



In this section, we define, for any boolean formula (/>, a corresponding 3DT-instance /<^. We also 
prove that is 3DT-collapsible if and only if (p is satisfiable. 

3.1 Block Structure 

The construction of the 3DT-instance uses a decomposition into blocks, defined below. Some 
triples will be included in a block, in order to define its behavior, while others will be shared between 
two blocks, in order to pass information. The former are unconstrained, so that we can design blocks 
with the behavior we need (for example, blocks mimicking usual boolean functions), while the latter 
need to follow several rules, so that the blocks can conveniently be arranged together. 

Definition 11 (/-block-decomposition). An /-block-decomposition B of a 3DT-instance I of span 
n is an l-tuple (si, . . . ,si) such that si = 0, for all h & ; I — Ij, Sh < Sh+i and si < n. We write 
th = Sh+i for h £ 11; I - Ij, and ti = n. 

Let I = {Ti,T,ip). For h € [1; ^1; the factor Ug^+i ""8,^+2 ■■■Ut^ of the word representation 
ui U2 ... Un of I is called the full block B'^^ (it is a word over S U {■} ). The subword of B'^ where 
every occurrence of • is deleted is called the block Bh- 

For a G S, we write blockj^Q{a) = h if jp{a) E [s/i + 1 ; th} (equivalently, if a appears in the 
word Bh)- A triple {a,b,c) £ T is said to be internal if blockifi{a) = blocki^sib) = blocki^is{c), 
external otherwise. 

If T is a transposition such that for all /i G [[1 ; /J, T(sh) < T{th), we write t[B] the l-block- 
decomposition (t(si), . . . ,r(si)). 

In the rest of this section, we mostly work with blocks instead of full blocks, since we are 
only interested in the relative order of the elements, rather than their actual position. Full blocks 
are only used in definitions, where we want to control the dots in the word representation of the 
3DT-instances we define. Note that, for (Ti,(T2 G S such that blockj^Q{ai) = blocki^B{a2) = h, the 
relation ai < G2 is equivalent to g\ G2 is a factor of Bh- 

Property 10. Let B = (si, . . . ,si) be an I -block- decomposition of a 3DT-instance of span n, and 
i,j,k £ ll; nj be three integers such that (a) i < j < k and (b) 3/io such that Sh^ < i < j ^ tho 
or Sh^ < j < k < tho (or both). Then for all /i G [[1 ; I}, T^^k{sh) < Tf^ki^h), o,nd the l-block- 
decomposition T,^^^f^[B] is defined. 

Proof. For any /i G [I ; ZJ, we show that we cannot have i < Sh < j < th < k. Indeed, Sh < j 
implies h < ho (since Sh < j < th^ = Sho+i)' ^'^'^ 3 ^ implies /i > /iq (since th^-i = Sh^ < j < th). 
Hence Sh < j < th implies h = ho, but i < Sh,th < k contradicts both conditions ShQ < i and 
k < thQ- hence the relation i < Sh < j ^ th < k is impossible. 

By Property [l| since Sh < th for all /i G [I ; /J, and i < Sh < j <th < k does not hold, we have 
\lk^'^h) < \lki^h)i which is sufficient to define t~J^^,[B]. □ 

The above property yields that, if (a, c) is a well-ordered triple of a 3DT-instance I = (E, T, if)) 
(r = r[a, 6, c, V']), and B is an Z-block-decomposition of /, then T~-'^[i3] is defined if {a,b,c) is 
an internal triple, or an external triple such that one of the following equalities is satisfied: 
blocki^s{_o) = blockifi{h), block j^sib) = block j^]3{c) or blocki^^c) = blockifi{a). In this case, the 
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3DT-step lt:M^r is written (/, ^'), where B' = t-^[B] is an /-block-decomposition 

of 

Definition 12 (Variable). A variable A of a 3DT-instance I = (S,T, -0) is a pair of triples 
A = [(a, 6, c), {x,y,z)] ofT. It is valid in an I -block- decomposition B if 

(i) 3ho G [1 ; /J such that hlocki^s(h) = blockj^is{x) = blockj^is{y) = ho 

(ii) 3hi £ ll; IJ, hi ^ ho, such that blockj^^la) = blocki^Q^c) = blocki^j^{z) = hi 

(iii) if X ~< y, then we have x <b<y 

(iv) a -< z -< c 

For such a valid variable A, the block Bhg containing {b,x,y} is called the source of A (we 
write source{A) = ho), and the block B^^ containing {a,c,z} is called the target of A (we write 
target{A) = hi). For /i S [[1 ; I}, the variables of which Bh is the source (resp. the target) are 
called the output (resp. the inputj of Bh. The 3DT-step ji^hlilXl' is called the activation of A (it 
requires that (x, y, z) is well-ordered). 

Note that since a valid variable A = [(a,6, c), {x,y,z)] has blocki^six) = blocki^is{y), its activa- 
tion can be written {I,B) ^'^'^''1 (1' ,B'). 

Property 11. Let {I,B) be a 3DT-instance with an l-block-decomposition, and A be a variable of 
I that is valid in B. Write A = [(a, 6, c), (x, y, z)]. Then {x,y,z) is well-ordered iff x ^ y; and 
(a, 6, c) is not well-ordered. 

Proof. Note that for ah a, a' G S, blockj^^io') < blocki^si'^') =^ a ^ a' . Write / = (S,r, V"), 



ho = source{A) and hi = target(A): we have /iq 7^ hi by condition (ii) of Definition 12 



If ho < hi, then, with condition (iv) of Definition 12, 6 ^ a ^ c, and either x ^ y ^ z ot 
y < X < z. Hence, (a, 6, c) is not well-ordered, and (x, y, z) is well-ordered iff x ^ y. 

Likewise, if hi < ho, we have a < c < b, and z<x<yoYZ<y<x. Again, (a,6, c) is not 
well-ordered, and (x, y, z) is well-ordered iff x -< y. □ 

Property 12. Let {I,B) be a 3DT-instance with an l-block-decomposition, such that the external 
triples of I = (S,T, -0) can be partitioned into a set of valid variables A. Let {d,e,f) be a well- 
ordered triple of I, such that there exists a 3DT-step (/, B) , B'), with I' = {T,' ,T' ,ip') . 
Then one of the two following cases is true: 

• {d,e,f) is an internal triple. We write ho = blocki^s{d) = blockj^Q{e) = blockf^is{f) . Then 
for all a G S', block['^ig'{a) = blocki^Q{a). Moreover if o'i,a2 G S' with blocks Qi{ai) = 
blockji^Q/{a2) 7^ ho and ai -</ ct2, then ai o"2. 

• 3A = [{a,b,c), {x,y, z)] G A, such that {d,e,f) = {x,y,z). Then blockji Qi{b) = target{A) 
and for all cr G S' — {b}, blockji ^jsi{a) = blockj ^is{a) . Moreover if ai,a2 G S' — {b}, such that 
c"! <72; then oi -<ii 02- 
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Proof. We respectively write r and i,j,k the transposition and the three integers such that r = 
Ti,j,k = T[d, e, f, tp] (necessarily, 0<i<j<A;<n). We also write B = (so, si, . . . , si). The triple 
(d, e, /) is either internal or external in B. 

If (d, e, /) is internal, with ho = blockj^i3{d) = blockj^Bie) = blockj^sif), we have (see Figure[7^ 

Sho < i < j < k < tho ■ 

Hence for all /i € [[1 ; I}, either Sh < i oi k < Sh, and T~^{sh) = Sh by Definition [1} Moreover, 
for all (T G S, we have 

i < V'(ct) < A; ^ ipia) € Isho + 1 ; thj and T"^(sft,J < i < r-^(V^(cj)) <k< r-^(t/,J 

=^ blocki^is{(7) = ho = blockji ^iS'{cr) 

il^{cr) < i or k < V'(fT) =^ r~"^('0(iT)) = ^(fx) 

=^ blockf ^iS'{a) = blocki^is{a) 

Finally, if c7i,(T2 € S' with blockji^Qi{ai) = blockji^Qi{a2) 7^ then we have both T~^(ip(ai)) = 
i/j{ai) and T~-^{xp{a2)) = ■0(c2)- Hence cii (12 44> cri CJ2. 

If (d, e, /) is external, then, since the set of external triples can be partitioned into variables, 
there exists a variable A = [(a, 6, c), (x, y, z)] G ^, such that (d, e, /) = (a, b, c) or (d, e, /) = {x, y, z). 
Since {d,e,f) is well-ordered in I, we have, by Property 11, {d,e,f) = {x,y,z) and x -<j y, 
see Figure [7]b. And since A is valid, by condition (iv) of Definition \l2\ x <j b <j y. We write 
ho = source{A) and hi = target{A), and we assume that ho < hi, which implies x y z (the 
case hi < ho with 2; ^/ x ^/ y is similar): thus, we have 

i = Tp{x), j = ^^{y), k = ip{z), and sh^ < i < j < thg < < k < th^. 

We define a set of indices U by 

U = {sh\hGll;lj}U {n} U {^{a) | a G S' - {b}}. 

We now show that for all u £ U, we have u < i or j < u. Indeed, if u = Sh for some /i G [1 ; /J, 
then either h < ho and u < s^g < i, or ho < h and j < t^^ < u. Also, if n = n, then j < u. Finally, 
assume u = tp{a), with a £ T,' — {b}. We then have x a y 4^ a = b, since x <i/ 6 <i/ y. Hence 
either o" -</ x and u < tpix) = i, or y -<i a and V'(y) = j < u. 

By Property [1] ii u,v G U are such that u < v, then t^^{u) < t^^{v). This implies that 
elements of S' — {6} = S — {6, x, y, z} do not change blocks after applying on ip, and that the 
relative order of any two elements is preserved. Finally, for b, we have x ~<j b ~<i y, hence 

i < ^{b) < j < Sh, < k < th,. 

Thus, by Property [l} r~^(shj < t"^(V'(6)) < T'^{th,), and blockr^s'ib) = hi = target{A). This 
completes the proof. □ 

Definition 13 (Valid context). A 3DT-instance with an I -block- decomposition {I,B) is a valid 
context if the set of external triples of I can be partitioned into valid variables. 

With the following property, we ensure that a valid context remains almost valid after applying 
a 3DT-step: the partition of the external triples into variables if kept through this 3DT-step, but 
conditions (iii) and (iv) of Definition 12 are not necessarily satisfied. 
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Property 13. Let {I,B) be a valid context and {I,B) 



{d,ej) 



{I',B') be a 3DT-step. Then the 



external triples of {I',B') can be partitioned into a set of variables, each satisfying conditions (i) 
and (ii) of Definition [ii 



Proof Let / = (S,r, V"), /' = {T,' ,T' ,ip'), A be the set of variables of /, and E (resp. E') be the 
set of external triples of / (resp. /'). From Property 
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two cases are possible. 



First case: {d,e,f) ^ E. Then for all a G S', blockp = blockj^is{a). Hence E' = E, and 
{I',B') has the same set of variables as {I,B), that is A. The source and target blocks of every 



variable remain unchanged, hence conditions (i) and (ii) of Definition 12 are still satisfied for each 
AeAin B'. 

Second case: {d,e,f) G E, and 3A = [{a,b,c), {x,y, z)] G A, such that {d,e,f) = {x,y,z), by 
Property 12 Then blockji^j3i{b) = target{A) and for all a £ T,' — {b}, blockp ^Qi{a) = blockjjsla). 



Hence blockj'^j3'{b) = blocki'^Q'^a) = blockj' ^]3'{c), and E' = E — {{x,y, z), {a,b, c)}: indeed, {x,y,z) 
is deleted in T' so {x,y,z) ^ E' , (a, 6, c) is internal in /', and every other triple is untouched. 
And for every A' = [{a',b',c'), {x',d',e')] £ A — {A}, we have blocki'^Q'{a) = blocki^s{a) for a S 
{a', 6', c', x', y', z'}, hence A' satisfies conditions (i) and (ii) of Definition [l2| in B' . □ 

Consider a block S in a valid context {I,B) (there exists h £ [l ; ^1 such that B = Bh), anc. 
(d, e, /) a triple of / such that (/, B) ^ Q') (we write B' = B'^. Then, following Property 

four cases are possible: 
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h ^ {blockj^is{d),blocki^is{e),blocki^is{f)}, hence B' = B, since, by Property 
order of the elements of B remains unchanged after the 3DT-step i^LJzIX. 

(d, e, f) is an internal triple of B. We write 
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the relative 



B (d,e,/) 



B' 



3A = [{a,b,c),{x,y,z)] such that h 
of B), see Figure [s] (left). We write 



source{A) and {d,e,f) = {x,y,z) {A is an output 



B 



>■ B' 



• 3A = [(a, 6, c), (x, y, z)] such that h = target{A) and (d, e, /) 
see Figure Is] (right). We write 



(x, y, z) {A is an input of B) 



B 



•A- 



*■ B' 



The graph obtained from a block B by following exhaustively the possible arcs as defined above 
(always assuming this block is in a valid context) is called the behavior graph of B. 
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Figure 7: Effects of a 3DT-step ^'^'^'^\ on an /-block-decomposition if (a) {d, e, /) is an internal 
triple, or (b) there exists a variable A = [(a, 6, c), (x, y, z)] such that {d,e,f) = {x,y,z). Both 
figures are in fact derived from Figure [3] in the context of an Z-block-decomposition. 
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source{A) 
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Figure 8: The activation of a variable A = [(a, 6, c), {x,y,z)] is written with a double arc in the 
behavior graph of the source block of A and with a thick arc in the behavior graph of its target 
block. It can be followed by the 3DT-step ^' , impacting only the target block of A. Dot 
symbols (■) are omitted. We denote by R, S, T, U, V, W some factors of the source and target blocks 
of A: the consequence of activating A is to allow U and V to be swapped in target(A). 




3.2 Basic Blocks 

We now define four basic blocks: copy, and, or, and var. They are studied independently in this 
section, before being assembled in Section |3.3[ Each of these blocks is defined by a word and a 
set of triples. We distinguish internal triples, for which all three elements appear in a single block, 
from external triples, which are part of an input/output variable, and for which only one or two 
elements appear in the block. Note that each external triple is part of an input (resp. output) 
variable, which itself must be an output (resp. input) of another block, the other block containing 
the remaining elements of the triple. 



We then draw the behavior graph of each of these blocks (Figures to 12): in each case, we 



assume that the block is in a valid context, and follow exhaustively the 3DT-steps that can be 



applied on it. We then give another graph (Figures 13 1 to 131), obtained from the behavior 
graph by contracting all arcs corresponding to 3DT-steps using internal triples, i.e. we assimilate 
every pair of nodes linked by such an arc. Hence, only the arcs corresponding to the activation 
of an input/output variable remain. From this second figure, we derive a property describing the 
behavior of the block, in terms of activating input and output variables (always provided this block 
is in a valid context). It must be kept in mind that for any variable, it is the state of the source 
block which determines whether it can be activated, whereas the activation itself affects mostly the 
target block. 

3.2.1 The block copy 

This block aims at duplicating a variable: any of the two output variables can only be activated 
after the input variable has been activated. 
Input variable: A = [{a,b,c),{x,y, z)] . 

Output variables: Ai = [(ai, 6i, ci), (xi, yi, zi)] and A2 = [(02, 62, C2), (x2, 2/2, -^2)]- 
Internal triple: (d, e, /). 
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Definition: 



[Ai,A2] = copy(^) 



a yi e z d y2 xi bi c X2 b2 f 



Property 14. In a block [^1,^2] = copy(j4) in a valid context, the possible orders in which A, Ai 
and A2 can be activated are {A, Ai, A2) and (74,^2,^1). 



Proof. See Figures l9] and [13 



□ 



3.2.2 The block and 

This block aims at simulating a conjunction: the output variable can only be activated after both 
input variables have been activated. 

Input variables: Ai = [{ai,bi,ci),{xi,yi, zi)] and A2 = [{a2,b2, C2), ix2,y2, ^2)]- 
Output variable: A = [{a,b,c), {x,y, z)]. 
Internal triple: (d, e, /). 
Definition: 



A = and(^i,^2) 



ai e zi a2 ci Z2 d y C2 X b f 



Property 15. In a block A = and{Ai, A2) in a valid context, the possible orders in which A, Ai 
and A2 can be activated are (Ai, A2, A) and (A2, Ai, A). 



Proof. See Figures 10 and [T3p. 



□ 



3.2.3 The block or 

This block aims at simulating a disjunction: the output variable can be activated as soon as any 
of the two input variables is activated. 

Input variables: Ai = [{ai,bi, ci), {xi,yi, zi)] and A2 = [(02, &2, C2), (x2, 1/2, ^;2)]- 
Output variable: A = [{a, b, c), {x, y, z)]. 
Internal triples: {a',b',c') and {d,e,f). 
Definition: 

A = or(^i,^2) 



a\ b' zi 02 d y a' x b f Z2 ci e c' C2 



Property 16. In a block A = or(Ai,A2) in a valid context, the possible orders in which A, Ai 
and A2 can be activated are (741,^,^2); [A2,A,A\), {Ai,A2,A) and (^2,^1,^)- 



Proof. See Figures 11 and[T3| 



□ 



3.2.4 The block var 

This block aims at simulating a boolean variable: in a first stage, only one of the two output 
variables can be activated. The other needs the activation of the input variable to be activated. 
Input variable: A = [(a, 6, c), (x, y, z)]. 

Output variables: Ai = [(ai,6i,ci), A2 = [(a2,^2,C2), {x2,y2,Z2)]. 

Internal triples: (di,ei,/i), (^2, 62,72) and {a',b',c'). 

Definition: 



[^1,^2] = var(A) 



di yi a d2 2/2 ei a' 62 xi bi fi c' z b' c X2 62 /2 
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a yi e z d y2 xi bi c X2 b2 f 



a yi e b d y2 xi hi c X2 b2 f 



(a,6,c) 

I 



d 2/2 xi bi yi e X2b2 f 



y id, if) 



dy2 e X2 b2 f\ 



X2 b2 y2 xi bi yi 



id,ej) y \^ 



A2 

\ 



X2 62 yi 



x\ bi yi 



A2 Ai 



Figure 9: Behavior graph of the block [^1,^2] = copy(j4). A thick (resp. double) arc corresponds 
to the 3DT-step for an input (resp. output) variable [(a, 6, c), {x,y,z)]. 
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ai e Zi a2 ci Z2 d y C2 X b f 



Ui e bi a2 ci Z2 d y C2 X b f ai e zi a2 ci b2 d y C2 x b f 



(ai,6i,ci) 



A2 Ai 



(a2,&2,C2) 

X 



a2 e Z2 d y C2 X b f 



ai e bi a2 ci b2 d y C2 X b f 



ai e zi d y ci X b f 



A2 



(ai,6i,ci) (a2,&2,C2) 



a2 e b2 d y C2 X b f 



ai e bi d y ci X b f 



(02,62,02) (ai,6i,ci) 



d y e X b f 



id,ej) 
I 



X b y 



A 



Figure 10: Behavior graph of the block A = and(Ai, A2) 
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ai b' zi a2 d y a' X b f zi C\ e c' ci 



'A. 



ai b' bi a2 d y a' X b f c\ e c' C2 a\ b' zi a2 d y a' x b f 62 ci e c' C2 



(ai,6i,ci) 



^4, 



(02,62,22) 



a2 d y a' X b f Z2 b' e c' C2 



ai b' bi a2 d y a' X b f 62 ci e c' C2 



ai b' zi ci e c' d y a' X b f 



'A. 



(ai,6i,ci) (02,62,02) 
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Figure 11: Behavior graph of the block A = or (^1,^2) 
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Property 17. In a block [^1,^2] = var(j4) in a valid context, the possible orders in which A, Ai 
and A2 can be activated are {Ai, A, A2), {A2, A, Ai) , (A, Ai, A2) and {A, A2, Ai). 



Proof. See Figures [12] and [T3| i. □ 

With such a block, if A is not activated first, one needs to make a choice between activating Ai 
or A2. Once A is activated, however, all remaining output variables are activable. 



3.2.5 Assembling the blocks copy, and, or, var. 

Definition 14 (Assembling of basic blocks). An assembling of basic blocks {I,B) is composed of 
a 3DT-instance I and an I -block- decomposition B obtained by the following process: 

• Create a set of variables A. 

• Define I = (J^jTjip) by its word representation, as a concatenation of I factors B\ B'2 ... B'^ 
and a set of triples T, where each B\^ is one of the blocks [^1, ^42] = copy(A), A = and(^i, ^2), 
A = or(^i,^2) or [^1,^2] = var(j4), with Ai,A2,A G A (such that each X €z A appears in 
the input of exactly one block, and in the output of exactly one other block); and where T is 
the union of the set of internal triples needed in each block, and the set of external triples 
defined by the variables of A. 

Example 2. 

We create a 3DT-instance / with a 2-block-decomposition B such that (/, B) is an assembling of 
basic blocks, defined as follows: 

• / uses three variables, A = {Xi, X2, Y} 

• the word representation of / is the concatenation of [Ai, A2] = var(y) and Y = or(Ai, A2) 

With Ai = [(ai,6i,ci), (xi,yi,zi)], A2 = [(02, ^2, C2), (a;2, ^2, 2:2)], Y = [{a,b, c), {x,y, z)], and 
the internal triples (di, ei, /i), ((i2, 62, /2), (a', fe', c') for the block var, and {a" ,b" , c"), {d, e, f) for 
the block or, the word representation of I is the following (note that its 2-block-decomposition is 
(0,18)): 

I = di yi a d2 y2 0! 62 xi bi fi c' z b' c X2 62 /2 b" zi a2 d y a" x b f Z2 ci e c" C2 



Indeed, a possible sequence of 3DT-steps leading from 7 to e is given in Figure 14, hence I is 
3DT-collapsible. 



Lemma 18. Let I' be a 3DT-instance with an I -block- decomposition B' , such that (/', B') is obtained 
from an assembling of basic blocks {I,B) after any number of 3DT-steps, i.e. there exist k > 
triples {di,ei,fi), i G [[1 ; kj, such that (/, i3)i^lifliA), . . . i^^if^(I', ^'). 

Then {I',B') is a valid context. Moreover, if the set of variables of {I',B') is empty, then I' is 
3DT-collapsible. 

Proof. Write A the set of variables used to define (/, B). We write / = (S, T, ip) and /' = (S', T' , ip'). 
We prove that (I', B') is a valid context by induction on k (the number of 3DT-steps between (/, B) 
and {I' ,B')). We also prove that for each h \1 ; l\, B'^ appears as a node in the behavior graph 
oiBh. 
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di yi a d2 y2 ei a' 62 xi h fi c! z h' c xi 62 h 



(di,ei,/i) 



a! 62 x\ bi yi a 6,2 2/2 c' z b' c X2 62 /2 



(c'2, 62,72) 



a' 62 a ^2 2/2 c' z b' c X2 62 /2 



di a xi hi fi c' z b' c X2 62 2/2 ei a' 



A 



di yi a d2 2/2 ei a' 62 xi hi fi d hh' c X2 62 /2 



(di,ei,/i) 



a' 62 xi hi 2/1 a d2 y2 c' b b' c X2 62 /2 



^ I 

Ai (a, 6, 6) 



(^2, 62,72) 



(ii 2/1 a a;i 61 /i c' z b' c 61 a' 



(a,6,c) 



di 2/1 a xi bi fi d hb' c X2 62 2/2 61 a' 



a' 62 a d2 y2 c' b b' c X2 62 /2 



(a, 5, c) 



(a, 6, e) A2 



di yi b' d2 2/2 61 a' 62 xi bi fi c' X2 62 /2 



(di,ei,/i) 



(£^2,62,72) 



di 2/1 a cci 5i /i d b b' c 61 a' 



(a, &, c) 



a' 62 xi 61 2/1 fo' ^2 2/2 e' X2 b2 /2 



di 2/1 6' xi bi fi d X2 62 2/2 61 a' 



A2 



a' 62 6' (i2 2/2 d X2 62 /2 



(a',6',c') 



(a',5',c') 



di 2/1 6' xi bi fi d 61 a' 



{a'M,d 



d2 2/2 62 xi bi 2/1 2:2 ^2 /2 



di 2/1 a;2 ^2 2/2 61 xi bi fi 



(^2,62,72) 



(di,6i,/i) 



ia\b',d) 



A. 



d2 2/2 62 X2 62 /2 



2^1 bi 2/1 2:2 62 2/2 



(^2,62,72) 



Ai A2 



d\ 2/1 61 xi 5i /i 



(rfl,6l,/l) 



X2 b2 2/2 



x\ bi 2/1 



Figure 12: Behavior graph of the block [^1,^2] = var(j4). 
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A^or{Ai,A2) 



[A,,A2]=variA) 





(c) 



(d) 



Figure 13: Abstract representations of the blocks copy, and, or, and var, obtained from each 
behavior graph (Figures [9|[lO}[Tl]and[l2| by contracting arcs corresponding to internal triples, and 
keeping only the arcs corresponding to variables. We see, for each block, which output variables 
are activable, depending on which variables have been activated. 
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I = \ di yi a d2 y2 ei a' 62 xi bi /i c' z b' c X2 62 /2 | ai zi a2 d y a" x b f Z2 ci e c C2 | 

\. ((ii,ei,/i) Internal triple of Si 

/lo = I a' 62 cci 61 yi o d2 2/2 d z b' c X2 62 /2 | ai 6" Zi a2 d y a" x b f Z2 c\ e c" C2 | 

J, (xijt/ijZi) Activation of Xi 

/g = I a' 62 a ^2 J/2 z b' c X2 62 /2 | Oil bi a2 d y a" X b f Z2 C\ e c" C2 | 

\. (ai, 61, Ci) Internal triple of B2 

Is = \ a! €2 a d2 j/2 z b' c X2 b2 f2 \ 0,2 d y a" x b f Z2 h" e c" C2 \ 

i {a", b", c") Internal triple of B2 

Ir = \ a' €2 a d2 y2 c' z b' c X2 b2 f2 \ a2 d y e X b f Z2 C2 \ 

4, {d, e, /) Internal triple of B2 

h = \ a! €2 a d2 y2 c' z b' c X2 b2 f2 \ 0-2 x b y Z2 C2 | 

4, {x, y, z) Activation of Y 

h = \ a' ^2 a d2 y2 c' b b' c X2 62 /2 | 0,2 Z2 C2 \ 

4, (a, b, c) Internal triple of Bi 

14 = \ a' 62 b' d2 y2 c' X2 62 /2 | 02 Z2 C2 I 

4, (a', 6', c') Internal triple of Bi 

h = \ d2 y2 e.2 X2 62 /2 | 02 2:2 C2 | 

i (^2, 62, 72) Internal triple of B\ 

I2 = \X2 b2 y2 I 02 22 C2 I 

i (x2, 2/27^2) Activation of X2 

Ii = I e I a2 ^2 C2 I 

i («2; &2, C2) Internal triple of B2 

/q = I £ I £ I = £ 



Figure 14: 3DT-collapsibility of the assembling of basic blocks [Xi, X2] = var(y) and 
Y = or{X\,X2). For each 3DT-step, the three elements that are deleted from the alphabet are 
in bold, the elements that are swapped by the corresponding transposition arc undcrUncd. Vertical 
bars give the limits of the blocks in the 2-block-decomposition, and dot symbols are omitted. 
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Suppose first that A; = 0. We show that the set of external triples of (1,13) = [I',B') can be 
partitioned into valid variables, namely into A. Indeed, from the definition of each block, for each 
cr G S, cj is either part of an internal triple, or appears in a variable A ^ A. Conversely, for each 
A = [(a, 6, c), [x,y,z)] G 6, x and y appear in the block having A for output, and a, c and z 
appear in the block having A for input. Hence (a, 6, c) and (x, y, z) are indeed two external triples 



of {I,B). Hence each variable satisfies conditions (i) and (ii) of Definition 12, Conditions (iii) and 
(iv) can be checked in the definition of each block: we have, for each output variable, y -< x, and 
for each input variable, a ~< z ~< c. Finally, each Bh appears in its own behavior graph. 

Suppose now that {I',B') is obtained from {I,B) after k 3DT-steps, k > 0. Then there exists a 
3DT-instance with an /-block-decomposition {I",B") such that: 

Consider /i G [[1 ; /J . By induction hypothesis, since B'/^ is in a valid context (/", B"), then, depending 
on (dfc, efc, /fc), either B'i^ = B'l^, either there is an arc from B'l^ to B'l^ in the behavior graph. Hence 



B'j^ is indeed a node in this graph. By Property 13, we know that the set of external triples of 
{I',B') can be partitioned into variables satisfying conditions (i) and (ii) of Definition 12 Hence 
we need to prove that each variable satisfies conditions (iii) and (iv): we verify, for each node of 
each behavior graph, that x < y ^ x<\h<\y (resp. a ^ z ^ c) for each output (resp. input) variable 
A = [(a, 6, c), (x, y, z)] of the block. This achieves the induction proof. 

We finally need to consider the case where the set of variables of (I',B') is empty. Then for 
each /i G [[1 ; we either have B'^ = e, or B'^^ = ah bh Ch for some internal triple {ah, bh, Ch) (in the 
case where Bh is a block or). Then {I',B') is indeed 3DT-collapsible: simply follow in any order 
the 3DT-step for each remaining triple (ah, bh, Ch)- □ 



25 



3.3 Construction 



Let (/) be a boolean formula, over the boolean variables xi,. . . ,Xm, given in conjunctive normal 
form: (j) = Ci A C2 A . . . A C^. Each clause Cc (c S [[1 ; 7]]) is the disjunction of a number of 
literals, Xj or -iXj, i G [I ; mj. We write qi (resp. qi) the number of occurrences of the literal Xi 
(resp. -iXj) m (j), i £ ll ; m}. We also write k{Cc) the number of literals appearing in the clause Cc, 
c £ [1 ; 7]]. We can assume that 7 > 2, that for each c S [[1 ; 7]], we have k{Cc) > 2, and that for 
each z G [1 ; mj, qi >2 and qi > 2. (Otherwise, we can always add clauses of the form (xj V ^Xi) to 
4>, or duplicate the literals appearing in the clauses Cc such that k{Cc) = 1-) In order to distinguish 
variables of an /-block-decomposition from xi, . . . ,Xm, we always use the term boolean variable for 
the latter. 

The 3DT-instance I,^ is defined as an assembling of basic blocks: we first define a set of variables, 
then we list the blocks of which the word representation of is the concatenation. It is necessary 
that each variable is part of the input (resp. the output) of exactly one block. Note that the relative 
order of the blocks is of no importance. We simply try, for readability reasons, to ensure that the 
source of a variable appears before its target, whenever possible. We say that a variable represents 
a term, i.e. a literal, clause or formula, if it can be activated only if this term is true (for some 
fixed assignment of the boolean variables), or if is satisfied by this assignment. We also say that 
a block defines a variable if it is its source block. 



The construction of is done as follows (see Figure 15 for an example): 

• Create a set of variables: 

— For each i G [[1 ; mj, create qi + 1 variables representing x^: Xi and Xj, j G [[1 ; q^J, and 
qi + 1 variables representing -iXj: Xi and Xj, j G [[1 ; iji}. 

— For each c G [I ; 7II, create a variable Tc representing the clause Cc- 

— Create m + 1 variables, A^p and vl^, i G [[1 ; mj, representing the formula (p. We will show 
that has a key role in the construction: it can be activated only if (p is satisfiable, 
and, once activated, it allows every remaining variable to be activated. 

— We also use a number of intermediate variables, with names U, U, V, W and Y. 

• Start with an empty 3DT-instance e, and add blocks successively: 



For each i G [[1 ; mj, add the following q^ + q^ — 1 blocks defining the variables Xi, X\ 
(j G [1 ;(?.!), and [j £ {l ; qi}): 

[X,,Xi]=var(^;) 
[Xl Uf] = copy(X,) [Xl, Uf] = copy(X,) 



[Xf, = copy(C/2) [Xf, Ijf] = copy(C72) 

[xr',ur']=copy(ur') ■■ 
[xr-\xr] = copy(^f-^) ixr-',ur'] = copyiur') 

[xr-\xr]= copy iur') 



(*) 
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— For each c E [[1 ; 7]], let Cc = Ai V A2 V . . . V A^, with k = k{Cc)- Let each Ap, p G [[1 ; A;]], 
be the j-th occurrence of a Hteral Xi or -iXj, for some i G [1 ; mj and j G [[1 ; qi} (resp. 
j G [[1 ; Qi}). We respectively write Lp = Xj or Lp = Xj. We add the following k — 1 
blocks defining Fci 

^2 = or(Li,L2) 
V,^ = or{VlL^) 

: (**) 
Vt'=or{Vt',Lk-i) 

^k) 



— Since (/> = Ci A C2 A . . . A C^, the formula variable j4(^ is defined by the following 7 — 1 
blocks: 

VFa = and(ri,r2) 
t^3 = and(M^2,r3) 

: (***) 
W^_i = and(W^_2,r^-i) 
= and(W^„i,rO 

— The m copies A^, . . . , of are defined with the following m — 1 blocks: 

[Al,Y2]=copy{A^) 
[A%Ys]= copy{Y2) 

rm-2) 



[A^-\A^]=copy{Yr 



m— 1 J 



3.4 Main Result 



Theorem 19. Let (j) he a boolean formula, and the 3DT-instance defined in Section 3.3. The 



construction of I^f, is polynomial in the size of (j), and (p is satisfiable iff I 4, is 3DT-collapsible. 

Proof. The polynomial time complexity of the construction of is trivial. We use the same 
notations as in the construction, with B the block decomposition of I^. One can easily check, 
in ([*]), ( |**[ ), (***) and (****), each variable has exactly one source block and one target block. 

we know that {I(f„B) is a valid context, and remains so after any number of 
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Then, by Lemma 

3DT-steps, hence properties [I4j [T5j [16] and [17] are satisfied by respectively each block copy, and, or 
and var of I 4. 

=^ Assume first that (p is satisfiable. Consider a truth assignment satisfying (j): let P be the 
set of indices i G [I ; mj such that Xi is assigned to true. Starting from I^^ we can follow a path of 
3DT-steps that activates all the variables of Ia, in the following order: 
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Figure 15: Schematic diagram of the blocks defining for (j) = {xi V 2:2 V -ixs) A (xi V -1X2) A 
(-1X1 V X2 V -1X4) A (-1X1 V X3 V X4) A (x3 V -1X4) A (-1X2 V -1X3 V X4). For each variable, we draw 
an arc between its source and target block. Note that (p is satisfiable (e.g. with the assignment 
ici = X3 = true and X2 = X4 = false). A set of variables that can be activated before is 
in bold, they correspond to the terms being true in for the assignment xi = X3 = true and 
X2 = X4 = false. 
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For i G [[1 ; mj, if i £ P, activate Xi in the corresponding block var in 0. Then, with 
the blocks copy, activate successively all intermediate variables U- for j = 2 to qi — 1, and 
variables Xj for j G [1 ; qij . 

Otherwise, if i ^ P, activate Xi, all intermediate variables for j = 2 to qi — 1, and the 
variables Xf for j G [1 ; qi} 

For each c G [[1 ; 7]], let Cc = Ai V A2 V ... V A^, with k = k{Cc)- Since Cc is true with the 
selected truth assignment, at least one literal Apg, po G [l; fcj, is true. If Xpg is the j'-th 
occurrence of a literal Xi or -iXj , then the corresponding variable Lp^ {Lp^ = Xj or Lp^ = Xj ) 



has been activated previously. Using the blocks or in (**), we activate successively each 



intermediate variable Vc for p = pQ to p = k — 1, and finally we activate the variable Fc 



Since all variables Fc, c G [l; 7]], have been activated, using the blocks and in (** 



we 



activate each intermediate variable Wc for c = 2toc = 7 — 1, and the formula variable A^. 



With the blocks copy in (****), we activate successively all the intermediate variables Yi, 



i G [[2 ; m — Ij and the m copies A^, . . . , of A^. 

For i G [I ; mj, since the variable A^^ has been activated, we activate in the block var of ([*]) the 
remaining variable Xi or Xi. We also activate all its copies and corresponding intermediate 
variables U- or U- . 



For c G [I ; 7]], in (**), since all variables Lp have been activated, we activate the remaining 



intermediate variables Vf- 



At this point every variable has been activated. Using again Lemma 18 we know that the 



resulting instance is 3DT-collapsible, and can be reduced down to the empty 3DT-instance e. 

Hence is 3DT-collapsible. 

<^ Assume now that is 3DT-collapsible: we consider a sequence of 3DT-steps reducing 
to e. This sequence gives a total order on the set of variables: the order in which they are activated. 
We write Q the set of variables activated before A^, and P C [I ; m| the set of indices i such that 



Xi £ Q (see the variables in bold in Figure 15). We show that the truth assignment defined by 
(xj = true 4^ i £ P) satisfies the formula (p. 



For each i G [I ; mj, A^^ cannot belong to Q, using the property of the block copy in (**^ 
(each can only be activated after A^). Hence, with the block var in Q, we have 
Xi G Q =^Xj ^ Q. Moreover, with the block copy, we have 

Vl<j<gi, xieQ^XieQ (a) 
Vl<j<g„ xi gQ^ XiGQ^ Xi(^Q (b) 



Since A^ is defined in a block A^ = and(VF^_i, F^) in (***), we necessarily have PF-^-i G Q 



and F^ G Q. Likewise, since W^-i is defined by W^_i = and(VF^_2, F^_i), we also have 
W-y^2 G Q and r^_i G Q. Applying this reasoning recursively, we have Tc G Q for each 
cG[l;7l. 
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For each c G [1 ; 7I, consider the clause Cc = Ai V A2 V ... V Xk, with k = k{Cc)- Using the 



property of the block or in (**), there exists some po £ [l ; k} such that the variable Lp^ is 
activated before Tc- hence Lp^ £ Q. If the corresponding literal Ap^ is the j-th occurrence 
of Xi (respectively, -<Xi), then Lp^ = Xj (resp., Lp^ = Xj), thus by ([a]) (resp. Q), Xi G Q 
(resp., Xi ^ Q), and consequently i £ P (resp., i ^ P). In both cases, the literal Xpg is true 
in the truth assignment defined by {xi = true <^ i £ P). 

If is 3DT-collapsible, we have found a truth assignment such that at least one literal is true in 
each clause of the formula 6, and thus d) is satisfiable. □ 
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4 Sorting by Transpositions is NP-Hard 



As noted previously, there is no guarantee that any 3DT-instance / has an equivalent permutation 
TT. However, with the following theorem, we show that such a permutation can be found in the 
special case of assemblings of basic blocks, which is the case we are interested in, in order to 
complete our reduction. 

Theorem 20. Let I be a 3DT-instance of span n with B an I -block- decomposition such that {I,B) 
is an assembling of basic blocks. Then there exists a permutation ttj, computable in polynomial 
time in n, such that / ~ vr/. 

An example of the construction of tt/ for the 3DT-instance defined in Example [2] is given in 



Figure 16 



Proof. Let A be the set of variables of the /-block-decomposition B I = (S, T, Let n be the 
span of /, and L its domain. Note that L = [[1 ; nj. For any h £ [[1 ; /J, we write m{Bh) (resp. 
no{Bh)) the number of input (resp. output) variables of Bh. We also define two integers PhiQh by: 

Pi = 

V/i G [1 ; /J , qh=Ph + th- Sh + 3{ni{Bh) - no{Bh)) 
V/i G [2 ; /I , ph = Qh-i 

The permutation vr/ will be defined such that ph and have the following property for any h G 
ll; If. T^i{sh) = Ph, and TTi{th) = Qh- 

We also define two applications a, /3 over the set A of variables. The permutation tt/ will 
be defined so that, for any variable A = [(a, 6, c), (x, y, z)], we have 7rj{ip{a) — 1) = a{A) and 
7ri{ip(z) — 1) = f3{A). In order to have this property, a and (3 are defined as follows. 

For each h £ {1 ; Ij: 

• If Bfi is a block of the kind [Ai, A2] = copy (A), define 

a{A) =ph, ^{A)=ph + A. 

• If Bh is a block of the kind A = and(^i, ^2), define 

a{Ai) = Ph, =ph + 7, a{A2) = + 3, /3(^2) =Ph + 9. 

• If Bfi is a block of the kind A = or{Ai, A2), define 

aiAi) = Ph, f3{Ai) =ph + 13, a{A2) =Ph + ^, /3(^2) =Ph + 16. 

• If Bh is a block of the kind [^1, ^2] = var(^), define 

a{A) =ph + 5, p{A) =Ph + 9. 

Note that for every A £ A, a{A) and f3{A) are defined once and only once, depending on the 
kind of the block Btarget{A)- The permutation ttj is designed in such a way that the image by tt/ 
of an interval ^Sh + 1 ; t/ij is essentially the interval Iph + 1 ; Qhl ■ However, there are exceptions: 
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namely, for each variable A, the integers a{A) + l,a{A) + 2, 13(A) + 1, which are included in 
lPtarget{A) + 1 ] qtarget{A)l , are in the image of lssource{A) + 1 ] tsource{A)l ■ TMs is formally described 
as follows. For each h £ [[1 ; kj we define a set Ph by: 

Ph = Iph + 1 ; qhj U U {a{A) + l,aiA) + 2,f3iA) + l} 

A output of Bh 

\J {aiA) + l,aiA) + 2,l3{A) + l} 

A input of Bh 

We note that the sets {a{A) + l,a{A) + 2, 13(A) + 1} are distinct for different variables A, and 
are each included in their respective interval lptarget{A) + 1 ! QtargetiA)}- Hence for any /i S [l ; /]], 
we have \Ph\ = Qh — Ph + 3no(;B/i) — 2>ni{Bh) = th — Sh- Moreover, the sets Ph, h £ [1 ; ^1, form a 
partition of the set [[1 ; nj. 

We can now create the permutation ttj. The image of is 0, and for each ho from 1 to we 
define the restriction of ttj over Ishg + 1 ; i/iol as a permutation of PhQ, with the constraint that 
T^iitho) = Qho- Note that, if this condition is fulfilled, then we can assume Tri{sho) = Pho^ since, if 
ho = 1, 7r/(si) = 7r/(0) = = pi, and if ho > 1, vr/(s/io) = '^li.tho-i) = Qho-i = Pho- 

The definition of ttj over each kind of block is given in Table [T] This table is obtained by 
applying the following rules, until 7r/(n) is defined for all u G Ishg + 1 ; i/iol- 



VA = [(o, b, c), (x, y, z)] input variable of 

TTi{i^iz)) = aiA) + 3 (Ri) 

7r,(V(c)) = /3(A) + 2 (i?2) 
= [(a, b, c), (x, y, z)] output variable of Bh^ 

7rj{i;{x)) = /3(A) + 1 (i?3) 

TTl{ij{b)) = a{A) + 1 (i?4) 
Vii G [[s/iQ + 1 ; i/ij such that succj^{u) £ {sho + 1 ; i/iol 

7r/(u) = 7r7(sncc7"'^(u) - 1) + 1 (iJs) 



We can see in Table [T] that rules (Ri) and (R2) indeed apply to every input variable, and 
rules (R3) and {R4) apply to every output variable. Moreover: 



(^'i) 



Rule ( I-Rsi ) applies to every u G [[shg+i ; th^} such that 
u ^ {ip(b) , iljic) , ip{x) , il^iz) I A = [{a,b,c), {x,y,z)] input/output of Bhq}. 

A simple case by case analysis shows that the following properties are also satisfied. 

TTj defines a bijection from Ish^ + 1 ; t/ioJ to Ph^ such that Tri{tho) = Qho 



{P2) 
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Table 1: Definition of tt/ over an interval Is^^ + 1 ; ^hol) where Bh^^ is one of the blocks copy, and, 
or, var. We write s = s/j^ and p = ph^- We give the line tl^~'^{u) as a reminder of the definition of 
each block. We also add a column for = s as a reminder of the fact that Trj{s) = p. 

• If BhQ is a block of the kind [^1,^42] = copy(A), we write ai, Pi,a2, P2 the respective values 
ofa{Ai),/3{Ai),a{A2),/3{A2). 

u = s s+1 s+2 s+3 s+4 s+5 s+6 s+7 s+8 s+9 s+10 s+11 s+12 
7Ti{u) = p ai+2 p+8 p+A p+3 0:2+2 p+7 /3i+l ai+1 p+6 /32+1 02+! P+9 
V'~^(ii)= a yi e ^; d 2/2 61 c X2 62 / 

• If is a block of the kind A = and(Ai, A2), we write a, /3 the respective values of a{A), P{A). 

u = s s+1 s+2 s+3 s+A s+5 s+6 s+7 s+8 s+9 s+10 s+11 s+12 
7r/(u) = p p+14 p+7 p+S p+13 p+9 p+6 a+2 p+12 p+11 /3+1 a+1 p+15 
'i/j~^{u)= a\ e zi a2 ci Z2 d y C2 X b f 

• If BhQ is a block of the kind A = or{Ai, A2), we write a, j3 the respective values of a{A), j3{A). 

u = s s+1 s+2 s+3 s+4 s+5 s+6 s+7 s+8 s+9 
7rj{u) = p p+7 p+13 p+3 p+9 a+2 p+12 p+11 P+1 a+1 
iIj~^{u)= ai b' zi a2 d y a' X b 

u =s+10 s+11 s+12 s+13 s+14 s+15 

7rj{u) =p+16 p+6 p+15 p+10 p+8 p+18 
tp~^{u)= f Z2 ci e d C2 

• If Bha is a block of the kind [yli, ^12] = var(yl), we write cki, a2, ^2 the respective values of 
a(^i),/3(^i),a(^2),/3(^2). 

u = s s+1 s+2 s+3 s+4: s+5 s+6 s+7 s+8 s+9 
7r/(?i) = p ai+2 p+5 p+3 a2+2 p+12 p+1 p+14 p+4 (3i+l 
ip^^{u)= di yi a d2 2/2 ei a' 62 xi 

u =s+10 s+11 s+12 s+13 s+14 s+15 s+16 s+17 s+18 
TTi{u) =ai+l p+13 p+9 p+8 p+2 p+11 ^2+1 "2+1 P+15 

^-l(ti)= hi fi d Z y C X2 62 /2 
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\/A = [(a, b, c), (x, y, z)] input variable of BfiQ, 

TTiiij{a) - 1) = a{A) 

7Tj{ij{z) - 1) = p{A) 
yA = [(a, b, c), (x, y, z)] output variable of Bh^, 

TTi{tP{y)-l) = a{A) + 2 

TTiim - 1) = + 1 



(a) 

(^6) 



Now that we have defined the permutation ttj, we need to show that ttj is equivalent to /. 
Following Definition 10 we have 7r/(0) = 0. Then, L = [[1 ; nj, so let us fix any u G [[1 ; nj, and 
verify that 7r/(n) = 7rj{succJ^{u) — 1) + 1. Let h be the integer such that u E Ish + 1; th}- 

First consider the most general case, where there is no variable A = [{a,b,c), {x,y, z)] such 
that u £ {tp{b) , iIj{c) , ip{x) , ip{z)} . Note that this case includes u = ip{d), where d is part of any 



internal triple. Then, by Property (Pi 
have 7r/(n) = 7rj{succJ (u) — 1) + 1. 

Suppose now that, for some variable A 



we know that Rule (iis) applies to u, hence we directly 



[(o,6,c), {x,y,z)], we have e {'ilj{b),ij{c),tp{x),i;{z)}. 



Then Rules (i?i) and (R2), and Properties iPsh and (P4) apply in the target block of A. Also 



Rules (i?3) and (i?4), and Properties (P5) and (Pq) apply in the source block of A. Combining all 



these equations together, we have: 






TTiitPib)) = a{A) + 1 


= 7r/(V'(a) 


-1) + 1 


7rj{^P{c)) = P{A) + 2 


= TTiiipib) 


-1) + 1 


7ri{iP{x)) = p{A) + 1 


= TTi{-4>{z) 


-1) + 1 


tti{jP{z)) = a{A) + 3 


= 7r/(V'(y) 


-1) + 1 



by (P3) and (P3) 



by {R2] and (P5) 



by (P4) and (P4) 



by ^ and ^ 



For u = il){b) (resp. '4}{c),iIj{x)^iI){z)), we have succ^^{u) = il^{a) (resp. ^{b)^'il}{z)^'il){y)). 
Hence, in all four cases, we have tti{u) = ■ki{succJ^{u) — 1) + 1, which completes the proof that tt/ 
is equivalent to /. 

□ 

With the previous theorem, we now have all the necessary ingredients to prove the main result 
of this paper. 

Theorem 21. The SORTING by Transpositions problem is NP-hard. 

Proof. The reduction from SAT is as follows: given any instance (j) of SAT, create a 3DT-instance 
I^, being an assembling of basic blocks, which is 3DT-collapsible iff (j) is satisfiable (Theorem 19). 



Then create a 3-permutation vr/^ equivalent to (Theorem 20). The above two steps can be done 
in polynomial time. Finally, set k = dh{TTj^)/3 = n/3. We then have: 

(j) is satisfiable 4^ I^p is 3DT-collapsible 

4^ dtiiTj^) = k (by Theorem [9| since vr/^ ~ I^) 
dt{T^u) < k (by Property 11). 



□ 
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[Xi,X2] =var(r) 

Input variable (target of): Y 
Output variables (source of): Xi,X2 

si^O ti = 18 

pi=0 = + 18 - 3 = 15 

a{Y) =pi + 5 = 5 
/3(r) = pi + 9 9 


r = or(Xi,X2) 

Input variables (target of): Xi,X2 
Output variable (source of): Y 

S2 = 18 t2 = 33 

P2 — 15 52 = P2 + 15 + 3 = 33 

a{Xi)=p2 = 15 
/3(Xi) =p2 + 13 = 28 
a{X2) = J52 + 3 = 18 
/3(X2) = J52 + 16 = 31 


u = 
7r/(u) = 
^-\u) = 


Definition of tt/ over Jsi + 1 ; ii] : 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 
17 5 3 20 12 1 14 4 29 16 13 9 8 2 11 32 19 15 

di yi a d2 2/2 Ci a' 62 Xi hi fi c' z b' c X2 62 /2 


Definition of tt/ over [[s2 + 1 ; f2l : 
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 
22 28 18 24 7 27 26 10 6 31 21 30 25 23 33 
Oi b" zi a2 d y a" X b f Z2 c\ e c" C2 



Figure 16: Creation of a permutation vr/ equivalent to the assembling of basic blocks I = (S, T, ^) 
of span 33 defined in Example [2| following the proof of Theorem 20 



Note that the permutation vr/ defined by Theorem[20]is in fact a 3-permutation, i.e. a permuta- 
tion whose cycle graph contains only 3-cycles [3j (which is equivalent to saying that the application 
succ defined by succ(u) = TTj^{7rj{u — 1) + 1) has no fixed point, and is such that succo succo succ 
is the identity). Moreover, the number of breakpoints of vr/ is dh{T^i) = n. Hence we have the 
following corollary. 

Corollary 22. The following two decision problems are NP-hard: 

• Given a permutation vr 0/ [[0 ; nj, is the equality dt^Tr) = db{'JT)/3 satisfied? 

• Given a 3-permutation vr 0/ [0 ; nj, is the equality dt{ir) = n/3 satisfied? 

Conclusion 

In this paper we have proved that the Sorting by Transpositions problem is NP-hard, thus 
answering a long-standing question. However, a number of questions remain open. For instance, 
does this problem admit a polynomial time approximation scheme? We note that the reduction 
we have provided does not answer this question, since it is not a linear reduction. Indeed, by our 
reduction, if a formula (j) is not satisfiable, it can be seen that we have dt{TTi^) = dh{TTj^)/3 + 1. 

Also, does there exist some relevant parameters for which the problem is fixed parameter 
tractable? A parameter that comes to mind when dealing with the transposition distance is the 
size of the factors exchanged (e.g., the value max{j — i,k — j} for a transposition Tij^k)- Does the 
problem become tractable if we bound this parameter? In fact, the answer to this question is no if 
we bound only the size of the smallest factor, min{j — i,k — j}: in our reduction, this parameter 
is upper bounded by 6 for every transposition needed to sort tt/, , independently of the formula (p. 
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